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1. Introduction 

It is well-known, that the special theory of relativity can be derived without the speed 
of light axiom, see for example [JJ, [2], [3], [3] and follow the citations in these papers, 
some of them dating back to Kaluza 1924 and Ignatowski 1910. However, the standard 
texts on the special and general relativity like [5], [6], [7j, [8], [9] don't mention this at 
all. Only [10], [3] and [11] discuss this issue, but except for the last one, the treatment 
is still too much complex. 

It was reported that the possibility of deriving the special relativity without the 
speed of light axiom was discovered many times in the past [2], without realizing that 
this was already clear to Albert Einstein at the time of the writing [12]. 

The best argument on this issue is given in [IT]: either particles can be accelerated 
to arbitrary speeds, or they cannot. If they can, we get the Galileo transformation, 
if they cannot, then there must exist, mathematically speaking, a least upper bound 
c to particle speeds in any one inertial frame. By the relativity principle, this bound 
must be the same in all inertial frames, moreover, the speed c - whether attained or 
not by any physical effect - must transform into itself (otherwise we could get higher 
speed than c of some particle when transformed from S to S'). But when c transforms 
into itself, we are lead uniquely to the Lorentz transformation by the usual procedure 
employed in most of the texts. Thus the relativity principle by itself necessarily implies 
that all inertial frames are related either by Galilean transformations, or by Lorentz 
transformations with some universal c. The only role of the speed of light axiom is the 
determination of c. 

However it is not really intuitive that particles cannot be accelerated to arbitrary 
speeds. On the other hand, the fact that we will allow any possible transformation 
between S and S' (and derive the only two allowed possibilities) is much more plausible. 
It is of course equivalent, but the latter approach is more explicit. 

The basic principles which the Newtonian theory (and also the special theory 
of relativity) is built on are homogeneity, isotropy and the principle of relativity. 
This allows two and only two possible transformations: Galilean and Lorentzian. 
Experimentally the Galilean is not satisfactory for many reasons (the apparent speed 
of light limit and other problems), so we need to take the Lorentz one. There is no 
other option left, unless we want to sacrifice the principle of relativity or homogeneity 
or isotropy. 

Almost every aspect of this issue can already be found in the literature. However, 
what the author couldn't find, is a derivation of the special relativity in a rigorous, 
but simple, short and clear way. The amount of rigor is subjective, also some of the 
assumptions can be weakened, or made more precise, but what we want to achieve in 
this article is to choose some small amount of assumptions, put them into equations and 
from that point only work with the algebra. See the references, for example [13], [13] 
for a thorough description of what postulates are necessary and which can be weakened 
and also for a review of all the derivations of the Lorentz transformations known to the 
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author of [T3] , [2] until 1997 (together with his own new derivation — but we present 
a shorter one in this paper). 

Many articles (see the citations in JlJ) first derive the velocity addition law and 
the arguments are quite messy, referring to pictures many times [I], or nitpicking in 
unnecessary mathematics [3] , [15] , [16] , [IT] etc. The articles [IE] , [19] are very good and 
cover almost everything which is shown in this article, however they also concentrate on 
quite unimportant mathematical details and some of their derivations are unnecessarily 
complicated and long. The best approach known to the authors is [lj that derives the 
Lorentz transformation and the velocity addition law using a very clear arguments, first 
writing down algebraic relations that are equivalent to homogeneity, isotropy and the 
relativity principle and the rest is a pure algebra. He works in 2D spacetime though 
and only derives the Lorentz boost. 

In this paper we try to use the same, nice and simple arguments of P, but using the 
results from [15] . [T9] and [3], thus deriving everything in 4D spacetime and not only 
showing how to get the Lorentz boost, but also that all permissible transformations 
obey the orthogonal property, thus proving the invariance of the spacetime interval. 
And it is well-known, that the whole special theory of relativity can be derived from the 
invariance of the interval. 

2. Derivation of the transformation 

This short section is the main result of the article. The other sections are just more 
detailed explanations and discussions. 

Let's have two Cartesian systems S and 5", where S' is moving with the velocity 
v along the x-axis and at the time t — 0, S — S' (in other words the y and z axes of 
both systems are parallel and the x axes are the same, except that the origins x = 
and x' = are moving with the speed v with respect to each other: when x = vt, then 
x' = 0). 

We need to assume the homogeneity, isotropy and the principle of relativity. 
In sections 13.11 13.21 13.31 we show in detail, that these very general and "obvious" 
assumptions can be written mathematically using the following equations (Tl~a]) - (TTTzT) (if 
some of them look unintuitive or confusing, look into the sections 13.11 13.21 13.31 for the 
thorough derivation and explanation): 
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where A(v) is a matrix 



A(v) 



( ^oo Ooi ^02 ^03 \ 

aio an a 12 ai3 

a20 ^21 ^22 «23 
V &30 a 31 a 32 a 33 / 

and the coefficients only depend on v (homogeneity). We require 
A(0) = 1 

and also (relation between origins and parallel axes) 

x' — when x = vt, y = 0, z = 

x' — y' = when x = 0, y = 0, 2 arbitrary 

For each t> (relativity): 

= 1 

For each u and f there exist w such that (relativity): 

A(u)A(v) = A{w) 
For each v there exist v such that (isotropy) 

TA(v)T = A(v) 
where the matrix T is 
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For each v and each a (isotropy): 

R(-a)A{v)R(a) = A(v) 
where the matrix R(a) is: 
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In Appendix A it is shown, that by a pure algebraic manipulation, the above 
assumptions directly imply that 
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where K is an arbitrary constant independent on v. This is the Lorentz (K > 0) and 
Galilean (K = 0) transformation. 

Let us first review the equations above to see that they really are what we mean 
by the homogeneity, isotropy and the principle of relativity. And then we'll discuss the 
above result more thoroughly. 



3. Assumptions 

3.1. Homogeneity 

The most general transformation from S to S' is: 
t' = T(t,x,y,z,v) 
x = X(t, x, y, z, v) 
y' = Y(t,x,y,z,v) 
z = Z(t,x,y,z,v) 
The length of a rod put on the x-axis in the frame S is 

I = x 2 — X\ 

and in the frame S' the length will generally be different: 

I' = x' 2 -x\ = X(t,x 2 ,0,0,v) - X(t,X!,0,0,v) 

Homogeneity means, that if we move the left end of the rod in the frame S from x\ to 
X\ + h, the right end will move to x 2 + h giving the same length I = (x 2 + h) — (xi + h) — 
x 2 —x\ and that in the frame S' the new length I' = X(t, x 2 +h, 0, 0, v)—X(t, x±+h, 0, 0, v) 
will also be the same as before: 

X(t, x 2 , 0, 0, v) - X(t, xi, 0, 0, v) = X(t, x 2 + h, 0, 0, v) - X(t, x x + h, 0, 0, v) 



so 



X(t, x 2 + h, 0, 0, v) - X(t, x 2 , 0, 0, v) = X(t, x x + h, 0, 0, v) - X(t, x u 0, 0, v) 
and dividing by h and taking a limit h — > 0: 



dX 
dx 



i,x 2 ,0,0 



dx 

dx 



t,xi,0,0 



but x\ and x 2 are arbitrary, so ^ is constant so X(t, x, y, z, v) is linear with respect to 
x. Similar procedure shows, that X(t, x, y, z, v) is linear with respect to y, z and t, and 
the same for Y, Z and T, which means, that 

/ t \ 
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y 

V z J 
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where 



A(v) 



( «00 ^01 «02 ^03 \ 

a 10 an ^12 ai3 

a 2 o «21 0-22 ^23 

V a 3 o a 3 i a 32 a 33 / 



and the coefficients a„ v only depend on v. This is the assumption (flal 



3.2. Principle of relativity 

The relativity principle means, that the functional form of the transformation A(v) is 
the same when transforming from S' to S. The S' has the speed v as seen from S, 
however, the reciprocal speed of S as seen from S' can be generally anything, so we 
denote it by <p(v): 



( t \ 

x 

y 

\ z ) 



A(<p(v)) 



( t> \ 

x' 

y' 



from which we get: 



x 

y 

V z J 



A(v(v))A(v) 



( t \ 

x 

y 

V z J 



or 



1 



A(<p{v))A(v) 

In our derivation, we assume (p(v) = —v (and we get the assumption (JTej) ). because it 
is natural. However, as is shown in [19], it is not necessary, but it adds a complexity 
to the derivation and our motive is not to find the weakest assumptions possible, but a 
reasonable set of natural assumptions, such that the Lorentz transformation inevitably 
follows from them. 

Now let S" be moving with a speed u with respect to S' . Then the relativity 
principle requires, that transforming from S to S' and then to S" is the same as 
transforming from S to S" directly (with some other speed w): 

A(u)A(v) = A{w) 



This is the assumption (If) 



3.3. Isotropy 

Isotropy of space implies (among other things), that the transformation doesn't change 
when we reverse the x-axis, i.e. that reversing the x-axis, applying the transformation 
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for the speed v and reversing the x'-axis again is the same as applying the transformation 
directly (but for some other speed v). The matrix that reverses the x axis is: 



So the above statement means: 



T 



(I \ 
0-100 

10 

V 1 / 



TA(v)T = A(v) 

This is the assumption ( [TgD . 

The isotropy also implies, that since the only significant spacial direction is that of 
the (x, x')-axis - the direction of motion - the transformation A(v) must be the same 
as if we first rotate about the (x, x')-axis, transform and then rotate back: 

R(-a)A(v)R(a) = A(v) 

where the R(a) is a matrix, that rotates the system around the x axis: 

\ 

n i n n 

R(a) 



( 1 
1 







COS Oi 






sin a 



\ — sin a cos a J 



And this is the assumption (fTTij) . 



4. Discussion 



In Appendix A it is shown, that the above equations imply 



A(v) 
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where K is a constant independent on v. 

It can be shown jl] that K < is inconsistent, so we set K = 4-, where c is a 
constant, independent of the frame of reference (because K is), with a dimension of 
speed (possibly c = oo) and we get our final formula: 
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For c = oo we get the Galilean transformation: 
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For c finite we get the Lorentz transformation, but the value of c is not determined by 
the theory and must be measured in experiment. 

For many centuries up to around 1905, it was known from an experiment, that 
the c is very high or possibly infinite and it couldn't be determined at that time, so 
setting c = oo was the correct answer (they didn't think this way, but they could if 
they wanted and even Galileo could have derived the special theory of relativity |20j). 
However now it's clear, that the theory gives the correct results, when we set c to be 
the speed of light (notice however, that in general, the c doesn't have to be the speed of 
light). So the speed of light axiom can actually be rephrased as: "Don't use the Galilean 
transformation, because it doesn't work, and if you get some maximum allowed speed 
in the theory, it is the speed of light" . 



5. Invariance of the spacetime interval 

It is easy to show, that the Lorentz transformation above (K > 0) obeys the 
orthogonality relation: 

r] = A T r]A 

where A is the Lorentz transformation matrix and rj = diag(— 1, 1, 1, 1) is the Minkowski 
tensor. Written using indices: 

Va/3 = V^a^fi (2) 

and it can also be shown, that any transformation defined by the orthogonality relation 
is either a boost (the transformation derived above), or spatial rotations, reflections of 
axes or translations (see any book on the quantum field theory, for example [21]). All 
of them are valid transformations between S and 5". So the orthogonality relation can 
be taken as the definition of all possible transformations between frames. 
Now we define the space time interval ds 2 by 

ds 2 = rj^ydx^dx v 

This is invariant for all transformations defined by the orthogonality relation (T5]): 

ds 12 = r] iiv dx'^dx' v = rj^A" a dx a A" 'pdx = r] aP dx a dx p = ds 2 

On the other hand, all the transformations that leave the interval invariant must be of 
the form (T5]), because 

dx'^ dx' u 

ds' 2 = ■ri uu dx' fJ 'dx' u = ri uu - — dx a -—^dx f3 = ds 2 = ri a0 dx a dx 13 

dx a dx l:> 
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This is true for all dx a and dx 13 , so we get: 



dx'v- dx' 



(3) 




dx p dx' 







But then 



dx'» 



A". 



dx 



a 



are constants (depending only on v) and Q are the orthogonality relations (j2j). In other 
words, the orthogonality relations are equivalent to the invariance of the interval. 

So the starting point to the special theory of relativity can be any of these (all of 
them are equivalent, as shown in this paragraph): 

• homogeneity, isotropy, the principle of relativity and the requirement, that we don't 
want the Galileo transformation 

• the orthogonality relation 

• invariance of the spacetime interval 

6. Conclusion 

We showed from the homogeneity, isotropy and the principle of relativity that the 
only possible transformations between S and S' are either the Galileo or Lorentz 
transformation, but nothing else. Contrary to other texts, we first wrote explicit 
equations and then only used a pure algebra to derive our result. 
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Appendix A. Derivation of the Lorentz transformation 

From ijTTij) we get by multiplying by R(a) from left: 



R(a)A(v) = A(v)R(a) 



This must hold for any a and in Appendix B it is shown, that 
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where 

kP{9) = 



k cos 9 k sin 9 
—ksm.9 kcos9 



for some values of the parameters k(v) and 9{v), that are functions of v. However, from 
ffTdj) we get (for all v and z): 





z' 



k cos 9 k sin 9 \ ( 
— ksm.9 kcos9 J \ z 



From which zk sin 9 = for all z, so /csin# = and that implies 

\ k(v) ) 

This k(v) can be positive, negative or zero. So now A(v) has this form: 
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where the constants A, B, C, D, E only depend on v, the direct velocity, and from (TTch 
we get = A(v)vt + B(v)t so 

B(v) , k s 

In other words, we can always determine the direct speed v from the matrix elements. 
From ( fT^D we get 



TA(v)T 



( D(v) 
-B(v) 


v o 



-C(v) 
A(v) 










E(v) 




A(v) 



\ 





E(v) J 
\ 



( D(v) C(v) 

B(v) A{v) 

E(v) 

V E(v) j 

Comparing the two matrices we see that B(v) = —B(v) and A(v 
from OA. ID we have v = — and v = — 4rr, but then v = —4?^ 
get these relations by comparing the matrix elements of the two matrices: 



A(-v) 
B(-v) 
C{-v) 
D(-v) 
E(-v) 



A(v) 

-B(v) 

-C{v) 

D(v) 

E{v) 



- A(v). However, 
4t4 = —v and we 

A(v) 



(A.2) 
(A.3) 
(A.4) 
(A.5) 
(A.6) 
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Using (ITej) and the symmetries f1A.2h - (1A.6|) we get: 



A(-v)A(v) 



multiplying: 



/ D(v) 
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o \ 
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(A.7) 
(A.8) 
(A.9) 
(A.10) 
(A.ll) 
1 (of course 



From ( 1A.11I) we get E(v) = ±1, but from (flfr!) we have E(0) = 1 so E(v ) 
we require that matrix elements are continuous). 

If for some v the A(v) ^ D(v), then B(v) = from (1A.8I) . thus A(v) = ±1 from 
(IA.7I) and from ( 1A.1I) we get v = — = 0, which means that A(0) ^ -D(O), but that is 
a contradiction with ffTB . that asserts A(0) = D(0) = 1. 

So we must have A{y) = D{v) for all v, then from ( 1A.7I) we get C{y ) = A and 
from (lA.lj) follows B(v) 



-vA(v): 
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A 2 -l 
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where A(v) is an unknown function of v, except that A(0) = 1 (follows from ( fT6j) ) 
we use (IT) 



Now 



A(u)A(v) 



1 A u 
-uA u 
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uA u 

A u 
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A v 
-vA v 





Al-l 
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Multiplying the matrices: 

f A U A V + (A^ 

A{u)A{v) = 



1) 



v A v 
uA u 



i uA u 



\ 








A U A V + (A u - l){ Av 
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1 / 
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and 



A(w) 



( A w 
-wA w 


v o 



A, n 








0^ 



1 
1, 



so comparing the two expressions for A w (the first and the second diagonal element) we 
get: 



A 2 



v 2 Al 



u 



'Al 



where the left hand side only depends on v, the right hand side only on u, thus both 
sides are equal to a constant K, that is independent of the frame of reference, because 
it doesn't depend on the coordinates or v, so we get (remember A(0) = 1, so we take the 
positive square root) 



Av VI - Kv 2 



and we arrive at the expression for the transformation between S and 5": 
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Appendix B. Rotations 

For each a, we have: 



R(a)A(v) = A(v)R(a) 



where 



R(a) 



( 1 \ 

10 

cos ex sin a 

V — sin a cos a J 



1 

P(a) 



P(a) 



cos a sin a 
— sin a cos a 



1 cos a + ia 2 sin a = e 



ZOttJ 2 



A(v) 



(A, A 2 
\A 3 A 4 

and the a±, o~i and a 3 are the Pauli matrices. Then 



R(a)A(v) - A{v)R{a) 



1 \ / A l A 2 
P(a) )\A 3 A 4 



A 1 A 2 
A 3 A A 



1 
P(a) 



A 2 (t-P(a)) 
[P(a) - l)A 3 P(a)A 4 - A 4 P{a) 
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The parameter a is arbitrary, so A 2 = A 3 = and (we set A4 = a l + aiai + a 2 a 2 + a 3 a 3 ) 

P(a)A 4 - A 4 P(a) = e iaa2 (a + a x ai + a 2 a 2 + a 3 a 3 ) - (a + + a 2 a 2 + a 3 a 3 )e iaa ' 2 = 

= e iaa2 (ai(Ji + a 3 a 3 ) - (a 1 a 1 + a 3 a 3 )e iarT2 = i sin a (a 2 (a 1 a 1 + a 3 a 3 ) - (a^ + a 3 a 3 )a 2 ) = 

= 2sina(aiO"3 — a3<Ti) = 
Multiplying by a 3 from the left and taking a trace we get 

Tr 2cr 3 sin a{a\a 3 — a 3 a x ) = 2 sin a(a 1 Tr 1 — ia 3 Tr a 2 ) = 

but Tr a 2 = and Tr 1 = 2 so a\ = 0. Similarly a 3 = 0. So 

A 4 = a + a 2 a 2 = ke ida2 = kP{6) 



where k = \Joq + a 2 , cos 9 = ^ and sin^ = ^. So the matrix A(v) can always be 
written as: 

for some values of the parameters k(v) and 9(v), that are functions of v. Note, that if 
we rotate the axes before doing the transformation: 

At no/ x Mi \ ( 1 \ Mi ° \ 

= [ ifcP((?) J ( P{a) ) = { ifcP(6l)P(a) J 
We see that by rotating around the x-axis by the angle a = —9, we get 

f M \ 



A(v)R(-9) 



fcl 



Geometrically this means, that the A(v)R(— 9) doesn't rotate the y and z axes (only 
scales them by a factor of k). 

References 

[1] Palash B Pal. Nothing but relativity. European Journal of Physics, 24(3):315-319, 2003. 

[2] Sebastiano Sonego and Massimo Pin. Deriving relativistic momentum and energy. European 

Journal of Physics, 26(l):33-45, 2005. 
[3] R Torretti. Relativity and Geometry. New York: Dover, 1996. 

[4] Brian Coleman. A dual first-postulate basis for special relativity. European Journal of Physics, 
24(3):301-313, 2003. 

[5] B F Schutz. A first course in general relativity. Cambridge University Press, 1985. 
[6] H Goldstein. Classical mechanics. Addison Wesley 1950. 

[7] R P Feynman. Feynman Lectures On Physics. Addison Wesley Longman, 1970. 
[8] K S Thorne, C W Misner, and J A Wheeler. Gravitation. W. H. Freeman, 1973. 
[9] S Weinberg. Gravitation and Cosmology: principles and applications of the general theory of 
relativity. Wiley 1972. 



Simple derivation of the special theory of relativity without the speed of light axiom 14 

[10] W G Dixon. Special Relativity: The Foundation of Macroscopic Physics. Cambridge University 
Press, 1940. 

[11] W Rindler. Introduction to Special Relativity. Oxford University Press, 1991. 

[12] A. Einstein. Zur Elektrodynamik bewegter Korper [AdP 17, 891 (1905)]. Annalen der Physik, 

14:194-224, February 2005. 
[13] J H Field. A new kincmatical derivation of the lorentz transformation and the particle description 

of light. Helv. Phys. Acta., 70:542, 1997. 
[14] J H Field. The physics of space and time i: The description of rulers and clocks in uniform 

translational motion by galilean or lorentz transformations, 2006. 
[15] V Gorini. Linear kinematical groups. Commun. Math. Phys., 21:150-63, 1971. 
[16] L A Lugiato and V Gorini. On the structure of relativity groups. J. Math. Phys., 13:665-71, 

1972. 

[17] T Matolcsi. Spacetime Without Reference Frames. Akademiai Kiado, 1993. 

[18] V Bcrzi and V Gorini. Reciprocity principle and the lorentz transformations. J. Math. Phys., 
10:1518-24, 1969. 

[19] V Gorini and A Zecca. Isotropy of space. J. Math. Phys., 11:2226-30, 1970. 

[20] A Sen. How Galileo could have derived the special theory of relativity. American Journal of 

Physics, 62:157-162, February 1994. 
[21] M Maggiore. A Modern Introduction to Quantum Field Theory. Oxford University Press, 2005. 



